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Abstract —Recently, the way people consume video content has 
been undergoing a dramatic change. Plain TV sets, that have 
heen the center of home entertainment for a long time, are 
losing grounds to Hybrid TV’s, PC’s, game consoles, and, more 
recently, mobile devices such as tablets and smartphones. The 
new predominant paradigm is: watch what I want, when I want, 
and where I want. 

The challenges of this shift are manifold. On the one hand, 
broadcast technologies such as DVB-T/C/S need to be extended 
or replaced by mechanisms supporting asynchronous viewing, 
such as IPTV and video streaming over best-effort networks, 
while remaining scalable to millions of users. On the other 
hand, the dramatic increase of wireless data traffic begins to 
stretch the capabilities of the existing wireless infrastructure 
to its limits. Finally, there is a challenge to video streaming 
technologies to cope with a high heterogeneity of end-user devices 
and dynamically changing network conditions, in particular in 
wireless and mobile networks. 

In the present work, our goal is to design an efficient system 
that supports a high number of unicast streaming sessions in a 
dense wireless access network. We address this goal by jointly 
considering the two problems of wireless transmission scheduling 
and video quality adaptation, using techniques inspired by 
the robustness and simplicity of Proportional-Integral-Derivative 
(PID) controllers. We show that the control-theoretic approach 
allows to efficiently utilize available wireless resources, providing 
high Quality of Experience (QoE) to a large number of users. 


I. Introduction 

Not so long ago, video content was largely distributed 
through a small number of broadcast television stations. 
Viewers passively consumed prescheduled content on a single 
available device - the television set. Things began to change 
in the seventies and eighties with the appearance of affordable 
video recorders that allowed to time-shift video programming, 
decoupling it from broadcasters’ schedules. Video recorders 
also enabled consumption of prerecorded movies available 
through rental or sale. Meanwhile, few decades later, we are 
starting to observe a dramatic shift of video consumption from 
plain TV sets, that have been dominating home entertainment 
for half a century, to devices such as Hybrid TV’s, personal 
computers, game consoles, and, more recently, mobile devices 
such as tablets and smartphones. This shift is accompanied by 
a new mindset: watch what I want, when I want, and where I 
want. 

The implications of this shift are manifold. Due to the asyn¬ 
chronicity of viewing patterns, broadcast-based distribution 


schemes need to be extended or replaced by multicast and 
unicast schemes, while the latter need to scale to millions of 
simultaneous viewers. In addition, an increasing fraction of the 
video content is distributed via wireless networks, including 
mobile networks, leading to a dramatic increase of wireless 
data traffic. Thus, traffic from wireless and mobile devices will 
exceed traffic from wired devices by 2018, accounting for 61 
percent of the total Internet traffic. And by far the largest part 
of it is video. Globally, video traffic will be 79 percent of 
all consumer Internet traffic in 2018, up from 66 percent in 
2013 11 . This development stretches the capabilities of the 
existing wireless infrastructure to its limits. Moreover, due to 
the extreme variability of network conditions on wireless and 
especially mobile networks, video streaming technologies are 
confronted with a challenge to deal with a high heterogeneity 
not only of end-user devices but also of dynamically changing 
network conditions El, El. 

It is well understood that the current trend of cellular 
technology (e.g., Long-Term Evolution (LTE) ||4l) cannot cope 
with the traffic increase caused by various new video services, 
unless the density of the deployed wireless infrastructure is 
increased correspondingly. In fact, throughout the history of 
wireless networks, throughput gains resulting from increased 
network density exceeded the gains from individual other 
factors by an order of magnitude Q- This motivates the recent 
flurry of research on massive and dense deployment of base 
station antennas, either in the form of ’’massive Multiple Input 
Multiple Output (MIMO)” solutions (hundreds of antennas at 
each cell site) IS or in the form of very dense small-cell 
networks (multiple nested tiers of smaller and smaller cells, 
possibly operating at higher and higher carrier frequencies) Q. 
If supplied with sufficient storage capacity, these technologies 
can also help reducing the load on the backhaul (i.e., the wired 
network connecting the access network to the Internet), which 
have recently become a bottleneck in cellular networks El. 

At the same time, streaming over best-effort networks, that 
were not designed to provide stable Quality of Service (QoS), 
especially over wireless networks, makes it highly inefficient 
or entirely impossible to use the same representation of a video 
for the duration of a streaming session. Instead, it must be 
adapted to dynamically varying network conditions such as 
throughput, packet loss rate, and delay jitter. A user might, 
e.g., experience continuous throughput fluctuations ranging 
from tens of kilobits to tens of megabits per second. Even 
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a static user in an indoor residential or office Wireless Local 
Area Network (WLAN) is typically exposed to interference, 
cross-traffic, and fading effects. The link quality fluctuations 
are even stronger in the case of mobile users. Thus, it is 
necessary to continuously adapt the QoS requirements of the 
video in order to achieve a satisfactory QoE. Recent studies 
suggest that these challenges have not yet been successfully 
addressed. In 2013, around 26.9% of streaming sessions on the 
Internet experienced playback interruption due to rebuffering, 
43.3% were impacted by low resolution, and 4.8% failed to 
start altogether 0. 

In the present study we focus on designing an efficient 
mechanism to support a high number of parallel streaming 
sessions in a wireless access network, such as a small¬ 
cell network. In order to maximize performance, we jointly 
consider the two problems of wireless transmission schedul¬ 
ing and video adaptation. We assume HTTP-Based Adaptive 
Streaming (HAS) as streaming technology 0, m, which 
is a client-controlled adaptive streaming approach, that uses 
Hypertext Transfer Protocol (HTTP) and Transmission Control 
Protocol (TCP) as its application layer and transport layer pro¬ 
tocols, respectively, and that has been deployed with increasing 
success in various large-scale streaming solutions. 

Inspired by the analytical tractability of PID controllers, 
complemented by their powerful features, we aim at designing 
a joint transmission scheduling and video adaptation mecha¬ 
nism that resembles dynamics of a system under PID control. 
A PID controller is typically used to stabilize a dynamic 
system around a given target state. Its strength lies in the 
ability to do so in the presence of model uncertainties (that is, 
the system parameters are not completely known and might be 
time-varying) and disturbances (unknown, potentially random, 
inputs to the system). In our case, we use it to stabilize 
users’ playback buffers around certain target values, in the 
presence of dynamically changing network conditions due to 
users arrival and departures, mobility and fading effects. 

We evaluate the developed approach by means of simu¬ 
lations in different deployment scenarios, such as long-term 
users with low user churn, short-term users with high user 
churn, and a mix of short-term and long-term users. We use 
QoE-related performance metrics such as total rebuffering 
time, start-up delay, average media bit rate, media bit rate 
fluctuations, and media bit rate fairness. 

The rest of the paper is structured as follows. Section |II] 
presents the related work. Section |III] describes the streaming 
model and the wireless network model underlying our study. 
Section |IV] presents our approach to joint scheduling and 
quality selection. Section |V] introduces the evaluation setting 
and results. Einally, Section |VT] concludes the paper. 

H. Related work 

Over the past few years, adaptive video streaming has 
gained a lot of attention from the research community. The 
focus is typically on optimizing user’s QoE, complemented 
by a differently weighted mix of fairness, resource efficiency, 
energy and cost factors. 

Many studies adopt the perspective of a streaming client, 
while modeling the network environment as a black box im- 


ini. In contrast to these studies, we jointly optimize video 
quality selection and network resource allocation in a dense 
wireless networks, in a distributed way. 

A number of studies specifically focus on video transmis¬ 
sion over wireless networks, leveraging cross-layer techniques 
that jointly perform video quality selection and network re¬ 
source allocation, for different types of wireless networks. 
In most such studies, video quality selection is performed 
in a centralized way lfT8ll - ll24l . In contrast, we assume a 
client-driven approach, where every streaming client performs 
adaptation individually and asynchronously w.r.t. other clients, 
which is inline with the HAS streaming model a, m. 
Moreover, while these studies focus on a setting with a single 
base station, we consider a small-cell network a with a dense 
base station deployment and a bandwidth reuse factor of 1. 
This setting is considered one of the candidate solutions to 
cope with the recently observed dramatic increase of wireless 
and mobile traffic. 

Streaming technologies using unreliable transport protocols 
have received a lot of attention in the past 100 . EOll . ||23]| . 
Il25l . In contrast, our study assumes TCP as transport protocol, 
which is reliable and thus requires a different QoE model 
since, on the one hand, it prevents packet losses but, on the 
other hand, exhibits increased throughput fluctuations due to 
built-in congestion avoidance and congestion control mecha¬ 
nisms ll26l . Moreover, due to TCP’s end-to-end, connection 
oriented semantics, it does not allow in-network manipulation 
of packets belonging to a particular video stream. 

Eurther studies focus on adapting the playout rate instead 
of video quality, in order to deal with dynamically varying 
network conditions ET). 

Using control-theoretic models in adaptive streaming is not 
a new idea. In CD, e.g., the authors design a video adapta¬ 
tion strategy which is based on a Proportional-Integral (PI) 
controller, with the goal to combat video quality oscillations 
they observe when multiple video clients share a common 
bottleneck. In 123, a controller with only a proportional com¬ 
ponent is used to stabilize a User Datagram Protocol (UDP)- 
based streaming system. Likewise, our work is inspired by the 
simplicity and power of a PID controller. Differently, however, 
we treat both transmission scheduling in the wireless access 
network and video adaptation as a single system, which has 
the potential to further improve the performance. In addition, 
we deploy an anti-windup scheme to stabilize the controller 
despite of saturation, a different discretization approach, and 
several heuristics aiming at further improving users’ QoE. 

Similar in spirit to our work, in ll28l . the authors address 
the goal of designing a joint transmission scheduling and video 
rate adaptation scheme using a Network Utility Maximization 
(NUM) framework, using the drift-plus-penalty method. The 
approach maximizes the sum of users’ utilities which are 
functions of the long-term average video qualities. A drawback 
of this approach is the coupling of transmission scheduling 
slot duration and video segment size, requiring large start-up 
delays to combat buffer underruns. 

Another hot and challenging topic is QoE for adaptive video 
streaming, see e.g. ll2^ - ll^ . Quality adaptation, rebuffering, 
start-up delay, and quality fluctuations are factors that have not 
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been part of traditional QoE metrics for video, but that have 
a tremendous impact on user’s perception of adaptive video 
streamed over a best-effort network, such as the Internet. User 
engagement is another important metric, which is especially 
of interest for content providers since it is directly related to 
advertising-based revenue schemes l(34l . 

III. System description 

In this section we describe the streaming model and the 
wireless network model used in our study. 

A. Streaming model 

We consider a set of users u G Ut at time t, where each 
user wants to watch a video file from a library of possible 
files. Corresponding to the HAS model ||9l, each video file is 
segmented in chunks of r seconds duration, and each segment 
is encoded in several representations, each representation 
providing a different average encoding bit rate. Further, each 
segment starts with a random access point of the stream, 
thus allowing a video client to concatenate segments from 
different representations during the playback. A video client 
sequentially issues HTTP GET or GET RANGE requests to 
download individual segments. The meta information about 
available segments and representations is downloaded by the 
client prior to starting the streaming session, e.g., in form of an 
XML file, called Manifest or Media Presentation Description 
(MPD). An example for this technology is the open standard 
MPEG-DASH ifT^ . 

If a segment is not downloaded until its playback deadline, 
a buffer underrun occurs, typically followed by a rebuffering 
period, during which the playback is halted. In addition, a 
video client might initially delay the start of the playback in 
order to prebuffer a certain amount of video (start-up delay). 

The goal of a video client is to maintain a high QoE by 
adapting the media bit rate to the available network through¬ 
put. Among the main factors influencing the QoE are the (1) 
amount of time spent in rebuffering and the distribution of 
rebuffering event over the streaming session, (2) the average 
and minimum video quality during the streaming session, (3) 
the number and distribution of quality changes, and (4) the 
duration of the start-up delay. The exact nature of QoE for 
adaptive streaming is an ongoing research problem, see also 
Section m 

B. Wireless network model 

We consider a wireless network with multiple user nodes 
and base stations sharing the wireless resource. The base 
stations either store cached video files, or have a (wired or 
wireless) connection to some video server, which we assume 
not to be the system bottleneck. In general, some user nodes 
might also participate in distribution of video data; therefore, 
we generally refer to base stations and user nodes serving as 
streaming sources as ’’helpers”. 

The network is defined by a bipartite graph Gt = 
{Ut, H, Et), where t is the time index, Ut denotes the set 
of users, H denotes the set of helpers, and Et contains 


edges for all pairs [h, u) G H x Ut for which there exists 
a potential transmission link between h and u at time t. We 
denote by A/'t(u) = {h G H : {h, u) G Et A Chu{t) > Cmin} 
the neighborhood of user u at time t. Similarly, J\ft(h) = 
{uGUt '■ {h,u) G Et A Chu{t) > Cmin}- In the following, we 
omit the index t to simplify the notation. 

Although the approach presented in this paper works with 
different kinds of wireless network models, we will focus 
on the wireless network model used in ll28l , as described 
in the following. The wireless channel for each link (h, u) 
is modeled as a frequency and time selective underspread 
fading channel llTSl . Using Orthogonal Frequency-Division 
Multiplexing (OFDM), the channel can be converted into a set 
of parallel narrowband sub-channels in the frequency domain 
(subcarriers), each of which is time-selective with a certain 
fading channel coherence time. The small-scale Rayleigh 
fading channel coefficients can be considered as constant over 
time-frequency ’’tiles” spanning blocks of adjacent subcarriers 
in the frequency domain and blocks of OFDM symbols in the 
time domain. 

We assume that transmission scheduling decisions are made 
according to the underlying PHY and MAC air interface 
specifications. For example, in an LTE setting, users are 
scheduled over resource blocks which are tiles of 7 OFDM 
symbols x 12 subcarriers, spanning (for most common channel 
models) a single fading state in the time-frequency domain. 

Nevertheless, it is unreasonable to assume that the rate can 
be adapted on each single resource block. As a matter of 
fact, rate adaptation is performed according to some long-term 
statistics that capture the large-scale effects of propagation, 
such as distance dependent path loss and interference power. 
It is well-known that with a combination of rate adaptation 
and hybrid ARQ, a link rate given as the average with respect 
to the small-scale fading of the instantaneous rate function, for 
given large scale pathloss coefficients and interference power, 
can be achieved. This averaging effect with respect to the 
small scale fading is even more true when massive MIMO 
transmission is used, thanks to the fact that, due to the large 
dimensional channel vectors, the rate performance tends to 
become deterministic 0, Eg). Therefore, in our treatment 
we shall use the link rate function given by eq. ([TJ. 

We assume that the helpers transmit at constant power, and 
that the small-cell network makes use of universal frequency 
reuse, that is, the whole system bandwidth is used by all helper 
nodes. We further assume that every user u, when decoding a 
transmission from a particular helper h G M{u) treats inter¬ 
cell interference as noise. Under these system assumptions, 
the maximum achievable rate during the scheduling timeslot 
ti for link (/i, u) G E is given by 

Ph9hu{l'i') \^hu\ j 

1 + '^h'GAf{u)\{h} ^h'9h’u{ti) \sh>u\ ) 

( 1 ) 

where is the transmit power of helper h, Shu is the small- 
scale fading gain from helper h to user u, ghu{ti) is the slow 
fading gain (path loss) from helper h to user u, and W is 
system bandwidth. 


lU-E 


log 14- 
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Consistently with current wireless standards, we consider 
the case of intra-cell orthogonal access. This means that 
each helper h serves its neighboring users u G J\f{h) using 
orthogonal FDMA/TDMA. We denote by ahu{ti) the fraction 
of time/spectrum helper h uses to serve user u in scheduling 
timeslot U. It must hold J2u£Af{h) othuiti) < 1 for all h G H. 
The throughput of user u in timeslot ti is then given by 
Cu{ti) = J2hGAfiu) oihuiti)chuiti). The underlying assump¬ 
tion, which makes this rate region achievable, is that helper h 
is aware of the slowly varying path loss coefficients ghu{ti) 
for all u G M{h), such that rate adaptation is possible. This is 
consistent with rate adaptation schemes currently implemented 
in LTE and IEEE 802.11 0, Illl, IMl- 

We assume that all helpers are connected to a cenralized 
network controller, which performs the scheduling decisions. 

IV. Joint scheduling and quality selection 

In this section we present a partially distributed mecha¬ 
nism consisting of two components: video quality selection, 
performed by each video client independently and asyn¬ 
chronously, and link transmission scheduling, performed by 
a centralized network controller at equidistant time intervals. 

A. General idea 

Consider a dynamic system described by the following 
equations 

x{t) = fi {x{t)) + z{t) 

y{t) = h , 

where x{t) is system state at time t, z{t) is input to the system, 
y{t) is system’s output, and /i(-) and / 2 (-) are some functions. 
Assume that we want to stabilize system’s output y{t) around a 
certain target value y*. Eor many systems, this can be achieved 
in an efficient way by setting their input using a PID controller, 
that is, 

z{t) = Kpe{t) + Kde{t) + Ki f e{^)d^ , 

JQ 

where the error function e{t) = y{t) — y* is the deviation of 
system output y{t) from the target value y*, and Kp, Kd, and 
Ki are controller parameters. 

In our case, we define system state as the vector of 
(playback) buffer levels x{t) = [xu{t),u G (7). The buffer 
level of a user is defined as the consecutive amount of video 
content, measured in seconds of playback time, stored in user’s 
playback buffer, starting from the current playback point. We 
denote by it) the encoding bit rate, or media bit rate, of the 
video content downloaded by user u at time t, measured in 
bit/s. Eurther, Cu{t) shall denote the throughput experienced 
by user u at time t, in bit/s. With this notation, the buffer 
level dynamics are represented by the following system of 
equations: 

iu(/) = ^-l, VmG(7, (3) 

where is the rate at which the buffer is filled by the 

arriving video data, and —1 is the rate at which the buffer is 
emptied by the playback. 


Particularizing the general dynamic system equation (|2]i to 
our specific case defined in @, we obtain fi {x{t)) = —1, 
/2 ix{t)) = x{t), and Zu{t) = That is, our goal is to 

stabilize users’ buffer levels x(t) around a target value x* by 
controlling In the following, to simplify notation and 

w.l.o.g., we set x* = 0. 

In order to apply PID control to buffer level dynamics (0, 
we have to set system’s input to the controller output 

^^4 = Kp^ux{t) -f Kd,ux{t) + Ki^u f XuiOd^ (4) 
Xu[t) Jq 

" -V-' 

= .Uln(t) 

for each user u G U. Consequently, the buffer level dynamics 
will obey the following differential equation 

Xu{t) = 0Ju{t) -1,'iuGU. (5) 


In the following we present a basic result on stability of 
system 0. Proposition[T]ensures that it converges to the target 
buffer level a;* = 0 for arbitrary initial values. 

Proposition 1: System 0 is globally asymptotically stable 
if and only if Kd^u ^ 1, i-kI ^ ® i-kI ^ 

Proof: In the following, we omit the user index u. 

Eirst, we observe that for Kd = 1 system 0 degenerates 
and is only solvable for zero initial value: a:(0) = 0. 

Assuming Kd ^ 1 and substituting y{t) = f* x(^)d^, we 
transform 0 into a system of linear differential equations 


'x{t) 


a b 

x(t) 


c 

y{i)_ 


1 0 



0 


with a = , , b = , , and c = 

1 —Aj’ 1 —Kd’ 


Ki 


1 

l-Kd' 


We proof the claim by explicitely constructing the general 
solution. We distinguish two cases. Case I: + 4b = 0. In 

this case, the solution for the initial value problem a;(0) = xq 
is given by 


x{t) = [xo + (0.5x0 — l)ai)e2“*. 


It converges to 0 for / —:> 00 if and only if o < 0. Eor case II: 
+ 4b 0, we obtain 


x{t) = 

dxp — axp -I- 2c Q dxp + axp — 2c q 

2d ^ 2d 


with d = sj 0 ? P 4b. It converges to 0 for / —00 if and only 
if Re{a + d) < 0 and Re{a — d) < 0, where Re{-) denotes the 
real part of a complex number. This, however, is equivalent to 
o < 0 and 6 < 0, proving the claim. ■ 

In real deployment scenarios, system’s input cannot 
always be set according to 0 due to several reasons that we 
list in the following, along with references to sections where 
we address them. 


The range of feasible values for the left-hand side of 0 


is bounded by 
throughput that 
and r„ 


0 , 


, where Cu 


is the maximum 


;"can be allocated to user u by the network, 
is the smallest available media bit rate. (The 
lower bound is attained if the download is paused. The 
upper bound is attained when the user downloads lowest 
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video quality at maximum available throughput.) This 
means that controller gain values outside of this region 
cannot be applied to the system. Such a controller with a 
limited gain is called saturated. It might become unstable 
due to the problem called integral windup. We address 
this issue in Section HV-BI 

• In a real deployment scenario, the system we consider is 
a sampled and distributed system. Both Cu{t) and ru{t) 
cannot be adapted in continuous time. They can only 
be modified at certain time instants, and after that they 
are kept constant for a certain period of time. Moreover, 
both transmission scheduling and quality selection take 
place at different time instants, since quality selection 
takes place once per video segment, while transmission 
scheduling takes place once per scheduling time slot. We 
address this issue in Section HV-CI 

• Lastly, stabilizing buffer level around a target value is 
not enough to ensure high QoE. It primarily aims at 
avoiding buffer underruns, but, in addition, we also would 
like to avoid excessive video quality fluctuations and 
unfairness among individual users. From the perspective 
of buffer level stability, assigning a user low throughput 
and low video quality stabilizes his buffer level as good 
as assigning him high throughput and high video quality, 
whereas from the perspective of QoE the second situation 
is clearly preferred (see, e.g., 1^ for the impact of 
quality fluctuations on QoE). We will address these issues 
in Sections IIV-DI and IIV-EI 

Finally, it is worth noting that it might make sense to define 
different target levels a:* for different users, in order to account 
for their individual mobility patterns, link statistics, and QoE 
expectations. We may imagine a specific per-user adaptation, 
e.g. at application layer, that acts on the control parameter x*. 
In the present work we do not address this outer control, and 
assume a common target x* for all users. 

Also note that in general, there are several reasons for 
keeping x* at a reasonably low level. Especially in the case of 
short videos, if a user prematurely quits the streaming session, 
the bandwidth used to download video data remaining in the 
buffer would have been wasted, engendering costs for content 
providers and network operators jm, iOl. Further, if a user 
downloads certain parts of a video in a low quality, e.g. due 
to poor network throughput, he might partially discard this 
data when network throughput increases, leading to a waste 
of bandwidth as in the previous case. Finally, in the case of 
a live stream, video content becomes available incrementally, 
imposing a trade-off between the value of x* and the viewing 
delay. 


B. Integral windup 

As mentioned in the previous section, in practice, it is not 
always possible to set control variables Cu{t) and r„(f) such 
that equality (|4|i is satisfied, due to limited link rate on the 
one hand, and a limited set of available media bit rates on the 
other hand. Our controller becomes a saturated controller. 

Saturated PID controllers have been subject of intensive 
research in the last decade. Partially, the attention has been 


motivated by control of robotic manipulators. One issue with 
saturated controllers is the so-called integral windup. For 
large deviations of x{t) from its target value, the unsaturated 
controller would apply a high positive or negative gain to bring 
x{t) back to X*. Due to the saturation, however, only a smaller 
gain can be applied. Thus, it takes more time to bring the 
state back to the target value. During this time, however, the 
error integral obtains a larger value than it would have had 
otherwise. The result is a higher overshoot and oscillations, 
leading to potential instability. 

To formalize the notion of saturation, we use the following 
notation 

{ ^min for X ^ Xmin 
a^max for X > Xmax 

X Otherwise 


With this definition, the saturated, and thus more realistic, 
version of system © can be written as 


Xu(f) = - 1, Vm € C/, (6) 

with 5inax > 1, and a;„(f) as defined in (|4|i. 

The following proposition, which leverages ideas and results 
from ED and Ea, shows that for small enough \Ki^u\, 
saturated system retains its global asymptotic stability 
property. 

Proposition 2: Assume that conditions of Proposition [Dare 
fulfilled. Then, for every set of initial values [xmin, a;max] with 
a^min < 0 < Xniax there exists a Ki^u > 0 such that for 
\Ki^u\ < Ki^u and x(0) S [xmin,a:max] the solution of the 
initial value problem (jb]) converges to 0 for t ^ oo. This 
property is sometimes called semi-global practical stability. 

Proof: Throughout the proof, we will omit the user index 
u. First, we rewrite (|6]l as 


x{t) 


ax{f) + b 


Pmax~l”C 1 

C-1 


- C, 


where a = , , b = , , and c = , V ■ Next, 

i—rSd ^ r-JSd r-JSd 

we substitute y{t) = b x{^)d^. We obtain the equivalent 

formulation 


x(f) = [ax{t) -b ^ - c 

y{t) = bx{t) 


Observe that the unique equilibrium of this system is (0, c). 
In order to shift the equilibrium to (0,0), we define x(f) = 
“ yW ~ obtain 

f x{t) = [ax{t) -b c] 3”“+'="^ - c -b 6 (ix(Q - ^y{t)) 
\y{t) = b{x{t) - ^y{t)) 

Next, we write the integral gain as Ki = eKi and substitute the 
time variable t' = ef, where e > 0. We define new variables 
x{t') = xif'je) and C,[t') = y{t'je) to obtain a ’’fast” version 
of our system 

f ex'(f') = [ax{t') + -c + eb - i^Cit')) 

\c'{t') = b{x{t')-],at')) 

( 8 ) 
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where b = Observe that (0, 0) is the equilibrium point 

of ([8]l. Further, observe that {x(t'), C(^0) (0i 0) as —?> oo 
implies that {x{t),y{t)) —?> (0, 0) as f —?> oo, and thus, stability 
results for (|8]l transfer to (|6|l. Further, for e <C 1, the system (l8]l 
is in the form of standard singular perturbation ll42ll . 

Now, it is relatively straightforward to validate that the 
conditions of Theorem 3 in ll42l are fulfilled for system ®, 
proving the claim. ■ 

Note that this result only guarantees stability if the in¬ 
tegral coefficient Ffi „ is small enough. However, making 
Ki^u smaller than necessary might have negative impact on 
convergence speed. In practice, it is difficult to compute the 
maximum value Ki^u that still ensures stability. Therefore, in 
the following, we present an alternative approach to solve the 
problem of integral windup: conditional integration. With this 
approach, the value of the integral part of the controller is not 
allowed to exceed certain hard limits. 

In particular, we use the equivalent form Q of the saturated 
system ©, and apply a bound t/i max on the error integral. We 
obtain 

x{t) = [aa::(<) + ^ “ c 

{ max (0, bx{t)) if y{t) < c - 
min (0, bx{t)) if y(t) >c + 
bx{t) otherwise 

where a, b, and c are as defined in the proof of Proposition |2] 
The resulting controller is then given by 

= aa;(f) + y(f), (10) 

ruit) 

with x{t), y{t) defined by @. The advantage of this formu¬ 
lation is that it accounts for the saturated gain, and limits the 
value of the error integral, reducing the impact of integral 
windup. An analytic study of the performance of this anti¬ 
windup strategy is notably complex, forcing us to resort to a 
simulative evaluation, presented in Section IV] 

Further potential anti-windup strategies include limiting the 
integral action of the controller from growing by keeping it 
constant whenever the controller enters saturation, adding anti¬ 
windup compensating terms to the integral action, etc. (see, 

e.g., my 

C. Sampled distributed system 

So far, we studied a system, where control decisions, that 
is, transmission scheduling and video adaptation, are made in 
continuous time. In practice, however, both control decisions 
take place at discrete time instances, while in between, the 
values of the control variables are fixed. In this section, we 
reformulate our approach to adapt it to this requirement. 

The challenge here stems from the fact that while we may 
assume that transmission scheduling takes place regurlarly, at 
equidistant time intervals, quality selection can only take place 
when a user starts downloading a new video segment, which 
happens for each user independently and, in general, on a 
different time scale than transmission scheduling. 

Moreover, time intervals between individual segment down¬ 
loads may be subject to considerable variation over time. 


Whenever the buffer level of a user is in equilibrium (that is, it 
stays around the target level for a certain period of time), the 
average duration of a segment download equals the duration 
of the segment itself. However, when buffer level is increasing 
or decreasing, the duration of a segment download might be 
subject to significant fluctuations. In addition, segment sizes 
might substantially deviate from representation averages, due 
to Variable Bit Rate (VBR) encoding used by modem com¬ 
pression technologies, causing further variations of download 
times. 

Let us for the moment assume that both control decisions, 
scheduling and adaptation take place simultaneously at time 
instants L, i = 0,1,... In order for the sampled system to have 
the same state as the continuous system at certain given time 
instants ti, we need to set our control variables as follows: 


iy'} 

ruit) 


xjti+i) - xjtj) 

ti-\-l ti 

' -.-^ 


Vf e 


[tij L-l-1)) 


( 11 ) 


where a;(L) is the buffer level at time ti and x{ti^i) is 
computed by solving the initial value problem defined by (|9ll. 

Proposition 3: Assume that conditions of Proposition [T] are 
fulfilled. Then, sampled control (fTTT i and continuous time sat¬ 
urated control with anti-windup ( fTOl l lead to identical system 
states at time instants ti, z S N. 

Proof: The claim is proven by substituting (fTTI) into ©, 
integrating the right hand side, and using xiti+i) that solves 
the initial value problem (|9]l. ■ 

In real deployment, however, Cu (t) and Vu (t) cannot be set 
simultaneously. Instead, we are dealing with a distributed sys¬ 
tem, where transmission scheduling and quality selection are 
performed independently from each other and at different time 
instants. In the following two sections, we present heuristics 
for controlling transmission scheduling and quality selection 
in a distributed way. 


D. Quality selection 

In the following, we present several heuristics that comple¬ 
ment the mechanisms presented in previous sections, so that 
we obtain a distributed, practically implementable approach. 
While this section covers the quality selection part, the fol¬ 
lowing Section HV-EI covers transmission scheduling. 

The idea we use to organize operation of our controller 
in a distributed way is the following. The network on the 
one hand and each individual user on the other hand shall 
try to maintain the equality (fTlTi every time they adapt their 
respective decision variable. The network does so at the 
beginning of each scheduling timeslot, while each user does 
so when he is about to request a new video segment. 

We denote by ts,u the time, when a user is about to start 
a segment download. We transform (fTTT l to obtain the quality 
selection rule 


( X _ Cujts^u) 

i -|- LOu\^s,U') ^f^u) 


( 12 ) 


where tf ^ is the time when the segment download would 
finish if the buffer dynamics would obey (lH). It can be 
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computed by solving = x{ts,u)+T-{tf^u-ts,u), with 

Xu{t) being the solution to the initial value problem defined 
by (|5]l. 

In addition, in order to avoid excessive quality fluctuations 
and provide a smooth adaptation trajectory, we use exponential 
moving average for quality adaptation. The network through¬ 
put Cu{ts,u) is approximated by a simple moving average of 
past throughput; 


ru{ts,u) = (l-a)ru{t^ J+a - . - - —r—, (13) 

-L “T ^f,u) 


where ru{tj^) is the previously selected video quality, a S 
(0,1), and T is a configuration parameter denoting the time 
period over which we compute the throughput average. 

Since only a finite set of media bit rates is available, we 
round ru{ts,u) down to the next available value: ru{ts,u) = 
max{r S i?„ | r < ruits,u)}- If {r € Ru \ r < r„(fs,u)} is 
empty, the lowest available media bit rate is selected. 

In order to account for the difference between the tar¬ 
get media bit rate ru{ts^u) and the actually selected me¬ 
dia bit rate fu{ts,u) < Xu{ts^u), we introduce a delay 
after the download of the current segment, computed as 
follows. If the media bit rate ru{ts,u) were actually avail¬ 
able and assuming user’s throughput Cu{ts,u) would remain 
constant, the buffer level after downloading the current seg¬ 
ment would be x{ts^u) ~ actu¬ 

ally selected media bit rate is smaller, however, the buffer 
level after the download will be larger. Thus, in order to 
account for the difference in media bit rates, we delay 
the subsequent request until the buffer level drops below 
max (|o, x{ts,u) - T, x(ts,u) “ (l + where the 

maximum operator is an additional precaution preventing the 
buffer from depleting more than the duration of one segment 
at a time, as well as from falling below the target level 0 as a 
result of a delayed request. 

During a particularly long period of low throughput the 
buffer may become empty, interrupting the playback. We call 
such an event a buffer underrun. In order to avoid multiple 
buffer underruns within a very short period of time, the 
client starts to play a segment only after it has been fully 
downloaded. This is called rebuffering. We limit the duration 
of the rebuffering period to the download time of one segment, 
that is, once we have at least one segment in the buffer, the 
playback is immediately restarted. 

Finally, at the begin of the streaming session when no 
throughput information is available, the client downloads the 
first segment in lowest quality in order to minimize start-up 
delay. 


problems. In the following, ru{ts,u) Cju{ts,u^tf,u) 
values computed by user u for the last segment requested 
prior to scheduling timeslot ti and communicated to the 
network as part of the download request. Further, we denote by 
Pu{ts,u) = the throughput demand 

of user It. 

First, the network controller maximizes the minimum frac¬ 
tion of user’s throughput relative to his demand, similar to the 
well-known maximum concurrent flow problem. At the same 
time, the controller tries to improve fairness by encouraging 
users streaming at low video quality to switch to higher quality, 
if there are sufficient network resources. This is achieved by 
artificially raising the demand pu{ts,u) of the 10% of the users 
with lowest demand to the 10th percentile across all users. We 
obtain the following optimization problem; 


max min 

Cli (ii') 

(TSl) 

max(p„(fs,u), PioiU)) 

uGU 

S.t. 

ahu < 1, VhG H 

(Cl) 


uGJ\fih) 

OLuh > 0 , Vh G H, Vu G U , 

(C2) 


where pio(fi) is the 10-percentile of {pu{ts,u),u G U). Recall 
that Cu(ti ) = J2hpAr(u)^huChu{ti). We denote the optimum 
value of dTSll l by 

In the second step, the network controller fixes the minimum 
relative allocated capacity to its optimum value r?*, and 
maximizes the minimum allocated capacity. 

max min 
uGU 

s.t. 


We denote the optimum value of ( ITS21 i by 

Finally, it fixes the minimum allocated relative capacity 
"d* and minimum allocated absolute capacity to and 

maximizes the total network throughput. In order to avoid 
high-amplitude throughput spikes for the individual users, 
it limits the capacity allocated to a user to either twice 
the median demand across all users or twice the minimum 
allocated capacity c^i„, whichever is larger. 


max Cu{ti) 

(TS3) 

ueu 


s.t. Cu{ti) ^ Vu G U 

(C4) 

CuiU) < 2max(c^in, PBoiU)), VuGU 

(C5) 

(Cl), (C2), (C3), 



Cu (ti ) 

< -7 .X 

ma,x{pu{ts,u), Pio[ti)) 

(Cl), (C2) 




(TS2) 

, Vug U (C3) 


E. Transmission scheduling 

The goal of the network is on the one hand to provide the 
capacities requested by the users, that is, to maintain (dUl. 
On the other hand, it shall allocate the remaining capacity, if 
available, in a fair manner in order to provide high network 
utilization and to eventually enable users to switch to a 
higher video quality. In order to achieve these goals, we let 
the network controller solve a series of linear optimization 


where p^oiti) is the median demand across all users. 

Since the maximum number of users per helper is limited 
by a technology dependent value, the number of optimization 
variables and constraints is O (max(|iF|, |(7|)), which can be 
handled very efficiently by modern linear program solvers 
even for large networks. Also note that most solvers allow 
to iteratively modify and reoptimize a model, which reduces 
the complexity of subsequent optimizations such as we have 
here. 











V. Evaluation 

In the following, we present our evaluation setting and 
results. All results were obtained by means of simulations. The 
simulation code was written in C++, we used Gurobi ll44ll to 
solve optimization problems, and we used odeint ||45]| to solve 
differential equations. 

Section lV-Al describes the performance metrics. Section lV^ 
describes the general setting, such as network and video 
parameters. Section IV-CI elaborates on the goal and design 
of the individual experiments. Finally, Section IV-DI presents 
evaluation results. 

A. Performance metrics 

We use the following metrics to assess the performance of 
the proposed system. 

1) Stability: A well-known issue with closed-loop control 
systems is their potential to become unstable, leading to 
high-amplitude fluctuations of the system state. Although not 
necessarily harmful per se, instability can have a dramatic 
impact on other performance metrics. We evaluate stability by 
means of buffer level statistics, such as the maximum buffer 
level overshoot and the minimum buffer level of a user during 
a simulation run. 

2) Rebuffering duration: When a client’s video buffer has 
been drained so that the next video segment does not arrive 
before his playback deadline, the playback must be halted. 
This is often refered to as a buffer underrun. A buffer underrun 
is followed by a rebuffering period, where the client waits 
until enough video data is accumulated in the buffer to resume 
playback. The conditions that need to be fulfilled before the 
playback is resumed depend on client’s rebuffering strategy. 
In our design, we resume playback after at least one segment 
is completely downloaded. In our evaluation we look at 
the cumulative rebuffering time a user experienced during a 
simulation run. 

3) Prebuffering duration (start-up delay): At the start of a 
streaming session, user’s video buffer is empty, so he has to 
wait until enough video data is downloaded to start playback. 
In contrast to rebuffering, a user at this state typically do 
not have information about network conditions. Especially 
when a user frequently starts a new streaming session, e.g., 
by switching TV channels, or when he repeatedly watches 
short videos, even a moderate start-up delay might severely 
degrade QoE and even make the user decide not to watch the 
video at all. 

4) Mean media bit rate: The mean video quality is obvi¬ 
ously a factor that dramatically influences the overall QoE, 
although studies have shown that it cannot be considered as a 
standalone QoE measure for adaptive video streaming. In our 
evaluation, we identify mean video quality with mean media 
bit rate during a simulation run. 

5 ) Media bit rate fluctuations: Recent studies have shown 
the significance of temporal quality fluctuations on the overall 
QoE. As a measure of quality fluctuations we use the fraction 
of segments played out in a different quality than the preceding 
segment. That is, a quality fluctuations measure of 1% means 
that one segment out of 100 was played in a different quality 


than its predecessor. For a segment duration of 2 seconds, e.g., 
this would mean one quality adaptation in 200 seconds. 

Note, however, that due to significant media bit rate fluctu¬ 
ations within a single representation, resulting from VBR en¬ 
coding used by most modern video compression technologies, 
quality fluctuations cannot always be completely avoided. 

6) Media bit rate fairness: Although a selfish user might 
not care about that, from the perspective of a service provider 
and/or system designer, a fare distribution of QoE among 
clients sharing a common bandwidth resource is essential 
for the overall system performance. In our evaluation, we 
computed video quality fairness as follows. Taking the set of 
mean video qualities of all users in one simulation run, the 
fairness index is defined as the interquartile range, that is, the 
distance between the 0.25 and the 0.75 quantiles. 

B. Evaluation settings 

In this section we describe general settings such as video 
and network parameters. 

1) Video related settings: For evaluation, we used a mix 
of 6 videos, contributed by the University of Klagenfurt ll46ll . 
For each video, we selected 6 representations, ranging from 
approximately 500 kbps to 4.5 Mbps. Segment duration was 
2 seconds. 

The target buffer level of the video clients was set to 20 

s. This value was shown to be sufficient to provide good 
performance in wireless networks in BtII . 

2) Network related settings: The simulated network spans 
an area of 50 x 50 meters, covered by 25 helpers, distributed 
on a uniform grid. The duration of a scheduling time slot is 
set to 10 ms. 

The path loss coefficients ghu (t) between helper h and user 
u are based on the WINNER II channel model ll4^ : 

9hu{t) = 10-o-iPL(d.„(0) ^ 

where dhu{t) is the distance from helper h to user u at time 

t, and where 

PL{d) = A\ogd-\- B -f Clog0.25/o + XdB • (14) 

In (O, d is expressed in meters, the carrier frequency /o in 
GHz, and XdB denotes a shadowing log-normal variable with 
variance cr^g. The parameters A, B, C and cr^g are scenario- 
dependent constants. Among the several models specified in 
WINNER II we chose the A1 model 1481, representing an 
indoor small-cell scenario. In this case, 3 < d < 100, and 
the model parameters are given by A = 18.7, B = 46.8, 
C = 20, (Tjg = 9 in Line-of-Sight (LOS) condition, or 
A = 36.8, B = 43.8, C = 20, cr|g = 16 in Non-Line- 
of-Sight (NLOS) condition. For distances less than 3 m, we 
extended the model by setting PL(d) = PL (3). Each link is in 
LOS or NLOS independently and at random, with a distance- 
dependent probability pi{d) and 1 —pfld), respectively, where 

1 if d < 3 m 

1 — 0.9(1 — (1.24 — 0.6 logotherwise . 

In the following experiments, d is updated in every scheduling 
time slot, while the random components, XdB and pi are up¬ 
dated every 5 seconds (except when d falls below 3 m, which 
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Fig. 1. Connectivity statistics for uniformely distributed users (left) and 
clustered users (right). From top to bottom: number of helpers per client; 
active link rates; sum of link rates per client; average number of users per 
helper for different total numbers of users in the network (with 10th and the 
90th percentiles). See Section rV-B2l for details. 


forces the link to switch into the LOS mode immediately, to 
maintain a consistent setting). 

Finally, links with a link rate below 2 Mbps in a particular 
scheduling time slot are not used for transmission. We call the 
remaining links active links. 

To visualize the resulting network conditions. Figure [T] 
shows some connectivity statistics. The left column shows 
statistics for users uniformly distributed over the simulated 
area, while the right column illustrates the case of clustered 
users, where users’ distance to the center of the simulated 
area is exponentially distributed with A = 0.2 In 4. The top 
subhgures show the histogram of the number of helpers that 
can serve a client at a particular location. The subhgures in 
the second row show the histogram of link rates over all active 
links. The subhgures in the third row show the histogram of 
the sum of link rates for a client. The bottom subhgures show 


the average number of users served by a helper, for different 
total numbers of users in the network, plus 10th and 90th 
percentiles. 

3) Controller parameters: All experiments are performed 
with different controller parameters, and different numbers of 
users. In all experiments, Kd is set to 0. This is common 
practice in many applications. The reason behind it is that 
when the derivative of the system state is estimated from 
sampled measurements and the measurements are noisy, the 
derivative action amplihes the noise, introducing additional 
jitter in the system variable. In the following, whenever 
appropriate, results are only provided for selected parameter 
values, to improve the readability of the paper. 

For better illustration, we would like to give an intuition 
for the scale of the parameters. For Kp = —0.05, if the buffer 
level is below the target value by 20 seconds (as, e.g., at the 
beginning of a streaming session), and the integral gain is at 
its equilibrium value (which is 1 in our case), then the client 
would try to download the video at twice the playback speed. 
That is, in one second the client would try to download two 
seconds of video, which would get him closer to the target 
value by 1 second. For a deviation of 10 seconds, the download 
rate would be 150% of the playback rate, and so on. 

All simulations were repeated 30 times. 

C. Experimental design 

We evaluate the proposed system in three types of experi¬ 
ments, each of them focusing on certain deployment scenarios, 
such as long-term users with no user churn, short-term users 
with high user churn, and a mix of short-term and long-term 
users. The performance is then compared with the performance 
of the baseline approach, described below. 

Each of the experiments is executed with two different user 
distributions, called uniform and clustered in the following. 
With the former, arriving users are dropped at a random 
location, uniformly distributed over the simulated area. With 
the latter, arriving users are clustered around the center of 
the simulated area, with exponentially distributed distance 
(A = 0.2 In 4). 

Upon joining the network, each user starts to watch a 
randomly selected video from a random point within the video. 
If he arrives at the end of the video, he continues to watch 
from the beginning. 

1) Experiment 1: The hrst experiment is intended to ana¬ 
lyze system’s behavior under constant load (hxed number of 
users) and without user churn (all users are long-term users). 
In particular, all users arrive during an initial arrival phase at 
the begin of the simulation and remain in the network until 
its end. Thus, after the arrival phase, the number of users in 
the network remains constant. 

The initial arrival phase starts at t = 0. The users arrive 
at a rate of 10 users per second until a predehned number 
of users is reached. Then, the simulation continues with a 
constant number of users for another 400 seconds. 

2 ) Experiment 1 *: In order to compare performance with a 
baseline approach, we rerun experiment 1 with the following 
transmission scheduling and quality selection. In each schedul¬ 
ing timeslot, every user is receiving data form exactly one 




































































10 




aj 

O 



time [si 


Fig. 2. Dynamics of one user during an example run of experiment 3. See 
Section rV-DI for details. 


helper, namely the one with the highest Signal-to-Interference- 
plus-Noise Ratio (SINR). (This is not always the closest 
helper, due to the random component in the pathloss.) Further, 
the client selects the video representation with the highest 
media bit rate that is still below the average throughput from 
the last 5 seconds. 

3) Experiment 2: In this setting, the goal is to analyze sys¬ 
tem’s performance under continuous user churn. That is, users 
continuously join and leave the network. As in experiment 1, 
there is an initial arrival phase, during which users arrive at a 
rate of 10 users per second until a certain number of users is 
reached. After that, ’old’ users leave the network, while new 
users join it, at a rate of 2 users per second. 

4) Experiment 3: This experiment is intended to study 
system’s behavior under constantly increasing load, in a de¬ 
ployment scenario with both short-term and long-term users. 
Here, new users continuously join the network at a constant 
rate of 2 users per second, and remain active until the end of 
the simulation run. 


D. Evaluation results 

In this section, we present evaluation results for the four 
types of experiments, described in Section IV-CI We split 
the results according to the considered performance metrics; 
stability, rebuffering duration, start-up delay, mean media bit 
rate, media bit rate fluctuations, and media bit rate fairness. 

Before we look into results for individual metrics, we 
would like to illustrate system behavior based on one example 
run. Figure |2] shows dynamics of one user during a run of 
experiment 3, with Kp = —0.05, Ki = —0.00001, = 0, 

gi,max — 0.1. The experiment runs for 500 seconds, that is, 
in the end there are 1000 users in the network. The plot 
shows the first user in the network, who starts to stream at 
second 0. The top subflgure shows the accumulated link rate to 
all neighboring helpers. The second subflgure shows network 
throughput allocated to the user. In scheduling timeslots that 
corresponds to inter-request delays, no resources are allocated 
to the user and thus his throughput drops to 0. The third 
subflgure shows the selected video representation, the fourth 
subflgure shows the buffer level. Finally, the bottom subflgure 
shows the total time spent in rebuffering. 

1) Stability: One of the issues that needs to be taken care of 
when designing a closed-loop controller is system’s stability. 
In order to study stability, we analyze buffer level statistics 
of each user during each of the simulation runs. In particular, 
we look at maximum and minimum buffer levels, where the 
maximum and minimum operators are first applied to traces of 
individual users, then to resulting per user values, and Anally 
to the whole set of runs for a specific configuration. In the 
following, we present results for experiment 1, with a uniform 
distribution of users across the simulated area. Results for 
other settings, omitted here for the lack of space, are consistent 
with presented findings. Finally, in order for the results not to 
be biased by system behavior during the initial arrival phase, 
we remove the initial arrival phase and the subsequent 100 
seconds from each trace. 

Figure shows the maximum buffer level overshoot (that 
is, the maximum difference between user’s buffer level and 
the target buffer level). As expected, if the integral gain 
coefficient is large, as compared to the proportional gain 
coefficient, the system tends to become unstable. At the same 
time, however, we observe that the conditional integration anti¬ 
windup technique successfully combats this effect, if pi,max is 
sufficiently small. Also the minimum buffer level, depicted in 
Figure 1^ confirms the efficiency of the conditional integration 
technique in avoiding system instability. We also studied the 
mean buffer level. We observed that it is within few seconds 
of the target buffer level, even for unstable configurations, and 
omit it here. 

In the following, we only report results for Kp = —0.05, 
Ki = —0.00001, and gt^max — 0.1, which we confirmed to 
be a stable configuration, and omit results for other configu¬ 
rations. 

2) Rebuffering: One of the main factors influencing QoE 
for video streaming is the amount of time a client spends 
rebuffering video data while the playback is halted. This 
happens when the playback buffer has been drained and the 
next segment does not arrive before its playback deadline. 
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Fig. 3. Stability analysis based on buffer level statistics from experiment 1, see Section IV-D II for details, (a) Maximum buffer level overshoot (difference 
between buffer level and target buffer level), (b) Minimum buffer level. 
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Fig. 4. Mean and maximum total rebuffering per user. See Section IV-D2l for 
details. 
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Figure |4] shows mean and maximum total rebuffering per 
user, where the mean (maximum) is first taken over all users 
of a simulation run, and then over all runs performed for a 
setting. As in the previous section, we remove the initial arrival 
phase and the subsequent 100 seconds from each trace, for 
experiments 1, 1*, and 2 (experiment 3 does not contain an 


initial arrival phase). 

As expected, we observe that the number of users the 
network can accomodate without rebuffering is higher with 
a uniform distribution of users. Moreover, we observe that the 
baseline approach results in significantly higher rebuffering 
values. 

3) Prebujfering (start-up delay): Another critical factor 
influencing QoE is the prebuffering duration, or start-up delay. 
Especially in a mobile context, when users tend to watch 
shorter videos, long start-up delays can be very annoying. 

Eigure |5] shows mean and maximum prebuffering delays 
for experiments 2 and 3, for uniform and clustered user 
distributions. Eor experiment 2, only users who arrived after 
the initial phase are considered. Eor experiment 3, to facilitate 
comparison, the x-axis shows number of users based on the 
user arrival rate of 2 users per second, instead of time. 

With Kp = —0.05 and a target buffer level of 20 seconds, 
new users try to download the first segment at twice the media 
bit rate, that is, within one second. When there are few users in 
the system, the network can satisfy the corresponding through¬ 
put requests and even allocate some additional capacity to the 
individual users. When there are too many users in the system, 
the network cannot allocate the requested capacity for every 
user. When the load is moderate, many user receive exactly 
the requested capacity, resulting in one second prebuffering 
delay, as can be seen in Figure |5] left column. 

4) Mean media bit rate: Figure |6] (top subflgure) illustrates 
mean video quality, represented by mean media bit rate, across 
all users. It shows results for all four experiments, for uniform 
(left column) and clustered (right column) user distributions. 
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Fig. 5. Mean and maximum prebuffering duration (start-up delay) for 
experiments 2 (top) and 3 (bottom). See Section IV-DSI for details. 


As in previous sections, the arrival phase was removed for 
experiments 1, 1*, and 2. In addition, the first 30 seconds 
of each user’s trace were removed from experiments 2 and 3, 
since a user always starts to stream at lowest available quality. 
Because of the latter, results for experiments 2 and 3 do not 
include settings with less than 60 users, where no user remains 
in the network longer than 30 seconds. 

We observe that experiment 1 and 1* exhibit comparable 
average media bit rates for both user distributions. With clus¬ 
tered user distribution, the controller driven approach offers 
slightly better values, especially for small numbers of users. 

5 ) Media bit rate fluctuations: Several studies have shown 
that severe quality fluctuations can dramatically degrade qual¬ 
ity of experience even if the average quality is high. Especially 
in cases where network conditions may change very fast, 
such as in wireless networks, video clients have to implement 
adaptation strategies that avoid to immediately adapt video 
quality to dynamically varying throughput but that only react 
to long-term throughput changes. 

The middle subhgure in Figure |6] shows media bit rate 
fluctuations for all four experiments. As described in Sec¬ 
tion we measure media bit rate fluctuations as percentage 
of segments that were played in a different quality than their 
predecessor. With this dehnition, a value of 1% means that 
an adaptation takes place every 100 segment. With a segment 
duration of 2 seconds, this corresponds to one adaptation in 
200 seconds. 

We observe that the amount of adaptations is up to twice as 
high with the baseline approach as with the controller driven 
approach. 

6) Media bit rate fairness: Finally, the bottom subhgure in 
Figure |6] illustrates fairness by showing the interquartile range 
over all users in a network of their respective mean media 



number of users (unf.)number of users (cist.) 


Fig. 6. From top to bottom: mean media bit rate in [Mbps]; mean percentage 
of segments played in a different representation than their predecessor (quality 
fluctuations indicator); mean interquartile range of media bit rate in [Mbps] 
(unfairness indicator). See Sections 1 V-D4IIV-DS I and I V-D61 for details. 


bit rates. The higher the value, the lower the fairness. We 
observe that, except for the case of clustered users with less 
than 60 users in the network, the controller driven approach 
offers signihcantly better fairness than the baseline approach. 

VI. Conclusion 

In this study, we presented an approach for joint trans¬ 
mission scheduling and video quality selection in small-cell 
networks. The core of the approach is a PID controller, known 
for its simplicity, analytical tractability, and robustness in the 
presence of modeling uncertainties and external disturbances. 
Although the controller in the studied setting is subject to 
saturation due to bandwidth constraints and constraints on 
available media bit rates, we successfully apply an anti-windup 
technique called conditional integration to stabilize the system. 
We additionally apply several heuristics that allow us to 
decentralize the designed mechanism, and specihcally address 
issues that are known to affect QoE. 

We evaluated the performance of our approach under differ¬ 
ent conditions, including uniformely distributed and clustered 
users, as well as different mixes of short-term and long¬ 
term users, and settings with high user churn. The evaluation 
results showed that the developed approach performs well, 
outperforming the baseline approach. 
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Future work includes development of an outer control loop 
for the target buffer level that might be adjusted based on 
user’s individual link quality statistics. 
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